TREC-2 Routing and Ad-Hoc Retrieval Evaluation using the INQUERY System
نویسندگان
چکیده
The ARPA TIPSTER project, which is the source of the data and funding for TREC, has involved four sites in the area of text retrieval and routing. The TIPSTER project (which includes MCC as a subcontractor), has focused on the following goals: Improving the eeectiveness of information retrieval techniques for large, full-text databases, Improving the eeectiveness of routing techniques appropriate for long-term information needs, and Demonstrating the eeectiveness of these retrieval and routing techniques for Japanese full text databases 4]. Our general approach to achieving these goals has been to use improved representations of text and information needs in the framework of a new model of retrieval. This model uses Bayesian networks to describe how text and queries should be used to identify relevant documents 6, 3, 7]. Retrieval (and routing) is viewed as a probabilistic inference process which compares text representations based on diierent forms of linguistic and statistical evidence to representations of information needs based on similar evidence from natural language queries and user interaction. Learning techniques are used to modify the initial queries both for short-term and long-term information needs (relevance feedback and routing, respectively). This approach (generally known as the inference net model and implemented in the INQUERY system) emphasizes retrieval based on combination of evidence. Diierent text representations (such as words, phrases, paragraphs, or manually assigned keywords) and diierent versions of the query (such as natural language and Boolean) can be combined in a consistent probabilistic framework. This type of \data fusion" has been known to be eeective in the information retrieval context for a number of years, and was one of the primary motivations for developing the inference net approach. Another feature of the inference net approach is the ability to capture complex structure in the network representing the information need (i.e. the query). A practical consequence
منابع مشابه
Recent Experiments with INQUERY
Past TREC experiments by the University of Massachusetts have focused primarily on ad-hoc query creation. Substantial eeort was directed towards automatically translating TREC topics into queries, using a set of simple heuristics and query expansion. Less emphasis was placed on the routing task, although results were generally good. The Spanish experiments in TREC-3 concentrated on simple index...
متن کاملDocument Retrieval and Routing Using the INQUERY System
The INQUERY retrieval and routing system, which is based on the Bayesian inference net retrieval model, has been described in a number of papers 5, 4, 10, 11]. In the TREC experiments this year, a number of new techniques were introduced for both the ad-hoc retrieval and routing runs. In addition, experiments with Spanish retrieval were carried out. For the ad-hoc retrieval experiments, the maj...
متن کاملCombining Evidence for Information Retrieval
This study investigated the effect on retrieval performance of two methods of combination of multiple representations of TREC topics. Five separate Boolean queries for each of the 50 TREC routing topics and 25 of the TREC ad hoc topics were generated by 75 experienced online searchers. Using the INQUERY retrieval system, these queries were both combined into single queries, and used to produce ...
متن کاملCOMBINING THE EVIDENCE OF MULTIPLE QUERY REPRESENTATIONS FOR INFORMATION RETRIEVAL l N.J. BELKIN and P. KANTOR
We report on two studies in the TREC-2 program that investigated the effect on retrieval performance of combination of multiple representations of TREC topics. In one of the projects, five separate Boolean queries for each of the 50 TREC routing topics and 25 of the TREC ad hoc topics were generated by 75 experienced online searchers. Using the INQUERY retrieval system, these queries were both ...
متن کاملCombining the Evidence of Multiple Query Representations for Information Retrieval
We report on two studies in the TREC-2 program which investigated the effect on retrieval performance of combination of multiple representations of TREC topics. In one of the projects, five separate Boolean queries for each of the 50 TREC routing topics and 25 of the TREC ad hoc topics were generated by 75 experienced online searchers. Using the INQUERY retrieval system, these queries were both...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1993